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Y>\ <1]0> Ago, Hideo 

Miyano, Maaashi 
Adachi , Tauyoshi 



SEQUENCE LISTING 



<120> hcv Polymerase Suitable for Crystal 

Structure Analysis and Method for using the Enzyme 



<130> SHIMQ07 

<140> 09/608,713 
<141> 2000-06-30 

<151> 1999-07-02 

<150* 11-192483 
<151> 1999-07-07 



<160> 12 



<170> FastSEQ for Windows Version 4.0 

<210> l 
<2\\> 551 
<2\2> PRT 

<213> Hepatitis C Virus 



<400> i 

Sor Mot Sor Tyr Thr Trp Thr Gly Ala Leu lie Thr Pro cys A l a Ala 
Glu Glu Ser jy fl L eu Pro lie. Asn Ala Leu Scr A Sn Ser Leu Leu Ar g 
His His Asn Met Val Tyr Ala Thr 11 Ser Arg Ser Ala Leu ^ 

Gin Lys Lys Val Thr Phe Asp Ar g Leu Gin Val Leu ll P A S p His Tyr 
Arc, Asp Val Leu Lye Glu Met Lys Ala L y 3 AU H Thr Val Lys AU 
IV* Lou Leu ser Val Glu Glu Ala Cy 3 Lys Leu Thr Pro Pro His ler 
Ma Lys Ser Lys Ph e Gly Tyr Gly Ala Lys Asp Val Ar g Asn Leu Ser 
8er Lys Ala Val Asn His He Hi, S^r Val Tr P L ys Asp Leu Leu Glu 
Asp Thr val Thr Pre Xle Asp Thr Thr lie Met Ala Lys A sn Glu Val 



135 140 



Phe cy fl V.l Ola Pro Glu Lys Gly Gly Arg Ly S P ro a1q Arg Lqu Ile 



Val Phe Pro Asp Leu Gly Val Arg Val Cys S Lys Met Ala Leu Tyr 

170 17£ 
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A*P Val Val Ser Thr Leu Pro Gin Val Val Met 



Gly Ser Ser Tyr Gly 



190 



180 ias 
Phe Gin Tyr Ser Pro Gly Gin Arg Val G i u Phe Leu Va1 " ' ^ 

19S 20 0 Thr Tr P 

..r Ly s lys is „ Pro Gly phQ ^ ^ p JOS ^ ^ ^ 
JJP S« Thr v,l Illt oiu A a „ ssp „. , rg val ™ s ^ iu t ^ 
oi. =y. c y8 SSP „„ pro „„ AU flrg J» au n< ^ „„ 

The Glu Ar 9 Leu ly r He Sly Gly Pro Lon ti „ 255 

260 y J™ L ° U Thr fls " s " ly» Gly Gin 

*» cy. Gly Tyr Arg ^ ^ M J Ser Gly ^ 270 ^ ^ 



280 



cy 0 Gly * s » Tllr Leu , hr ^ leu Lys iU sw »» ^ ^ 
»i. «. K. ^ „„ j., ^ Thr Hec leu vil «• my ^ 

™ V.l Il0 Cy S Glu S„ „. G!y Thr 01a ^ Aep ^ 320 

V.X ». Jjj 01. „. Thr fg £ s „ flla 33= ^ 

»™ Gl„ Pro „. Tyf ftap ^ ^ s ^ £ ^ s ^ 



Leu 



V ? l S „ M Ml Hls ^ „, ^ ^ ^ J., ^ ^ 

J£ X,, T ,, r Pro Leu stg am HI ^ 

- 3 Bl. Tto Pro vji A „ See Trp Leu Qly »• ^ ^ „„ 

P« »». «« £p Ala Alg „ sc IU «» ae ^ «. 

««. l0 u G!n G1 U Gl„ 1=u „ M . ^ «0 ^ ^ 

«y «« cy 8 Tyr Sor „. Pro ^ ftap Leu pro «. ne 

Arg Leu Hio Gly Leu Ser AT a Ph» e r . 450 

«6S y Ala Phe Ser Leu Hib Ser Tyr Ser Pro Gly 

Glu He Asn Arq val Ala <?pv t » 475 480 

9 vai Ala Ser Cy 3 L eu Arg Ly e Leu Gly Val Pro Pre 

X-u Ar 3 val Trp Ar g H i. Ar 3 Ala Ar f s!r Val Ar g Ala Ar 9 ^ ,eu 

Ser Gin Gly Gly Ar 9 Ala Al a TJ c ,, Gly Lyfl ^ ^ ^ 

vai L y, Thr Ws Leu ^ Leu Thr prQ ^ ^ S« ^ ^ ^ 

Leu Asp Leu Scr Glv Tm p)ip Ua i 540 

54 5 7 „J Phe Val Ala G1 V Ser Gly Gly Asp He 

Tyr Hie S ftr Leu Ser Arq ai a a™ p>-, * 555 550 
565 9 Ar9 Trp Phe Met Lcu c ys Leu 

Leu Leu Lou Ser Val Gly Val Gly He t3^ r P1 , T _ 575 
5 3 0 r iy ile Le ^ Leu Pro Asn Arg 

585 530 
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<21Q> 2 
<211> 2883 
<212> DNA 

<213> Artificial Sequence 

<220> 
<223> CDS 



5-teS» ns P°ly"«er a ,e and hi 3tid ine tag aC the 



<221> niisc feature 




1017, 



<223> n ^ A,T,C or G 



<221> misc^feature 

^03^ 152 ' 214 ' 294 ' 



* 1183 ,1249 12 7 0 234 9 ' ^ 7 "' 776 < "07, 1017. 

- "79, 1S24,' 18M % J ; * ' "Ol f l 5fl 8, 

/ 2605, 2634, 2760 ' 5 ' 22AQ ' 2313 ' 



2604 
2445 

<223> n = A,T,C or G 

s»r ni^^r 4 !,?,"-^!' "ja ™ ; »»■ "»- »»• 

»«: llll: lilt: '"«■ »»: »«: 

<223> n - A, T, C or G 
<221> misc_feature 

1604, 1579, la24 1B "' 2 o76' "° 5 ' 140 ^ 1501 , l Sfi8( 

2445, 2605, 2634, 276Q ' ^ " 21 ' 2225 ' 22 *°< "13, 

<2'A3u n = A,T,C or G 

<221> misc feature 

is;: j'*;,;;'- **:• ™- ™. »». 

"04, 1679 1824 9*' "01, 1588, 

^445, 2605, 2634, 2760 ' 2225 ' 2240 < =313, 

<223> n - A,T,C Or G 



<400> 2 

atgtcaatgt 

hrtrthrgya 

tettegctga 

aca acaLctc 

ucggcagaag 

thrhasargu 

acagt fcaagt 

agaggaagee 



cetacacatg 
aubhrruysa 
agugusrysu 
geagegcagg 
aaggtcacct 

yrargasvau 
tgcaagctga 



gacaggcgcc 
ageggaggaa 
raanaausra 
cctgarghah 
ttgacagact 
stacegggae 
ysgumtysaa 
cgcccccaca 



tCgatcacgc 
ageaagctge 
snsruuegee 
sasnmtvaty 
gcaagcccng 
gtgetcaagg 
ysaasrthrv 
taayeuusrv 



catgegctmt 
ccatcaacgc 
accataacat 
raathrthts 
gacgaccaca 
agatgaaggc 
aysgctaaac 
aguguaacys 



arrr^Grtyrt 60 
gttgagcaac 12 o 
Sgtttaiigcc 180 
rargsraagy 24 0 
rggnysysva 3 00 
gaaggegtec 3 60 
tcctatccgt 420 
ysuthrrrhs 4 80 
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X£ a™* ^gggg "gcjtcc r ur aay 3Sr y ahg £40 
gacttgctgs rsrysaavaa snhshLrva trye"uuai ° CatCcactc c 9tgtggaag 50 0 
acaccaccat ca tggcaa aa aatgagg^ SvaSS l??^ 9 acaccaaCt 9 
cttccgtgtc caaccagaga aagqaSLa J*™? s thrthr m t a aysasngugt 720 

sgrayargya raaargua? c gJS^S !^ f f C c 9Ccttvahc ysvagnrguy 7 fl Q 

ccctcvahra sugyvLrgv acyegSys*? aaStSK f^ 9 ^ ^aagatgg £,40 
gtcgtgatgg gctececaEa ctyr^va "SrSanv t9gCCtC " c «ttc cCeag 90 0 
CagtactCtc ctgggcagcg agtcgagS C^qaatl ST? 978 " rt ^9-"C 9 50 
vaguhuvaa. nthrtggaaa tcaa'gLa. accc^tS ct?r 8rr9y9nar 9 1020 

gltry 6S ry S ysasnrmtgy hsrtyra-th «2Sf 9 ? 9 cttttcatat gacactcgct 1080 
cgacatccge gttgaggagt ca«t£ B 2 thr?ath™ gactcaac 99 teacegagaa li 40 
"tgttgtga cttggccccc gaagccag^ ^f!^ ^rt.ec 1200 

rguaaarggn aayssretea cagagegget tfatat™ n yscysasuaa 1 260 

oggguthrgu argutyrgyg yruthrSs r^Saa SS^f CtaattCa ™ 132 ° 
gegegagegg cgtgctgacg actgnasncy 'S^l "^"at cgccggtgcc 1380 
thragctgeg gtaacaccc? cacatgtS tS™~? r f ^gaa srgyvauthr !440 
nthruchrcy atyruyaaaa raaaacyscg agJtgSS c^f 9 "' 9 tSrc ^ a * 1500 
c g tgaa C g ga gacgacarga aaaysugnal SstWu.?- CtCC *" act 9«cgatg Ct 1560 
tetgtgaaag cgcggga.cu caagagg^g cSSS^ av^*^ 3 ^^^"a 1620 
uaaaaaasrc tacgagtctt caeggagge? a?fa?5St 3 ^ Cys9usr ^gythrgng 160o 
gvahthrgua amtthrargt yraSaS^y gacccqcccc Tr^T Cccc ^9uar 1740 
etgaLaacaL catgttccas rrgnrguty? asuguuthr' rtlT 9 ^ ^"^9*3 1800 
cgcccacgat gcatcaggca aaaggg C g£ rcy 8srtcca atgtg tcggt la6o 

rgvatyrtyr ctcacccgtg atecScLc asrvaaahaa saasrgyysa i 920 

hrargaarth rthrruaLr gaaaatrgut hracSf* Q ^ ct ^ Sggagacaut l S80 
taggcaaeat tnttatgtat aaarah^ ^ 9Cta9aca ^tccagtt aactcctggc 2040 
ttgtgggeaa ggatga? ct g"gact C ac ttcttcSa ^t*™** ^gcccact 2100 
rh.nhhsracc Cttctagcgc agglqcaact t«™ "thrutraa argmtumtth 2160 

ngugnuguya aauascysfn tacg|ggcct gttactcc^ ^tcuuaag 2220 

agatcattty rgyaacysty rarquruasu tgagecaett gacctacctc 2280 

toccata gtL^cJ a^Eg^Jg yuT^u £1%?°" =340 
atagggtggc ttcatgcctc aggaaacttg ™[ hfi ^tyrsrr ggtgagatca 2400 

gysugyvarc ccttg C g agt ctLagaca? S ^ ^ as »^9va aaarcyauar 24 60 
rgvatrargh sargaaargs rvKrgaaar S ' " 934 9 c 9tccgcgc taggctarua 252 0 
gtggcaagta cetctCcaL usrgngyg^ ""^^ Sccgccactt 2S 60 

gtgaagacca aactcaaact caetc^a^c Y^yyetym hasntgggca 2640 

hrrraaaaar cagctggact tgtccggctg ' ^ thryauy S ut 2700 

uasuargytr hvaaagyty r argyqylsa? 9 9gtta C agcg ggggagacgn 2760 

cggacoccat cacca?tyrt J J 11,1 ^ ct 9t"cgtg cccgaccccg 2820 
taah3h. 3 h 3 USrarga »argrar ggy srhshahsca ccatcactaa 2 8 60" 

28B9 

<210> 3 
<211> 579 
•:212> PRT 

<213> Artificial Sequence 
«220> 

-223, DNA encoding fusion protein consisting of a 

M "°s er ^ ser Tyr Thr Trp Thr Gly ^ c ^ ^ 
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CU 



Ala Glu Glu Ser Ly 5 Leu Pro He Asn Ma Leu Ser Asn Ser Leu L 
Arg His „i. A8a Mec val Tyr ^ ^ a ^ 30^ ^ ^ 

Arg Gin Lys L y3 val Thr Ph. Asp Ar g Leu Gl n Val J* u Asp Asp „i. 
Tyr Ar g Asp Val Leu Lya Glu Me , Lyg AU Ly6 ^ ^ ^ ^ ^ 

Ala Lya Leu Leu Sor Val Glu Glu Ala Cys Ly s Leu Thr Pro Pro SJ. 
Ser Al, Lys Ser L y a Pho Gly Tyr Gly ^ Lyg Asp Vj>1 Arg ^ ^ 

Ser Lys Ala Val Asn His jj £ ser Val Xrp Lys Jsp Leu Leu 
Glu Asp Thr val Thr Pro ABp Thr Thr ^ "J ^ ^ ^ 

Vjl Ph. Cys Va! Gin Pro Glu Lys Gly Gly Arg J y9 ° Pro Ala ^ ^ 
He val Phe Pro Asp Lcu G.ly Val Arg Val Cys al* Lya Met Ala ™ 
iVr Asp val val Ser Thr Le u Pro Gin Val Val Met Gly S er Ser Tyr 
Gly Pho Gl„ Tyr s ar Pro Gly Gin Arg v.l G!u P he Leu v" Asn Thr 
Trp Lys Ser Lys Lys Asn Pro Met' Qly Phe Ser Tyr Aa °p Thr Arg c ys 
Ph. Asp scr Thr Val Thr Glu Asn Asp He Ar 9 vll Glu Glu Ser He 
Tyr Gin Cy, Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Ly S 
Lcu Thr Glu Arg Leu Tyr tie sly G i y ^ Leu Thr ^ ^ ™ ^ 
Gin Asn cys Gly Tyr Arg Ar 9 Cys III Ala Ser Gly Val III Thr Thr 
Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala III A la Ala Cys 
Arg Ala Ala Lys L C u aln Asp Cy s Thr Me, Leu Val A 3n Gly Asp A Sp 
Leu val val He Cy, Glu Ser Ala Gly Thr Gin Glu Asp Ala Ala sir 
Lou Arg Val Phe Thr Glu Ala Met Thr Arg Ty r Ser Ala Pro III oiy 
Asp p ro Pro flJn Pro Glu Tyr Asp HI Glu Leu ne Thr HI 

Scr Asn Val Ser val Al a Hi, ™ Ala Ser Gl y Lya Ar g Val T yr Tyr 

3 BO 

Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr 
Ma Arg His Thr Pro Val Ann Ser Trp Leu Gl y Asn He xl e Het J£ 
Ala P „ Thr Lcu Trp Ala Arg Met II, III Met Thr His ph£ JJ= ^ 
He Lcu Lcu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp C y s Gin He 
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Tyr Gly Ala Cya Tyr Ser lie G^u Pro Leu Asp Leu Gln Ue ^ 

Olu Arg Leu His Gly Leu til Ala Phe Ser Leu J" Ser Tyr Ser Pro 
«y □!« He Asn Arg Val Ala Ser Cys Leu 1^ Lys Leu Gly Val Pro 
Pro Lou Arg Val T rp Arg His A rg Ala Arg Ser Val Arg Ala Arg Lou 
Lou Ser Gin G ly Gly Arg Ala ^ Gly «0 p ^ ^ 

Trp Ala Val Lys Thr Lya Leu Lya Leu Thr Pro He Pro Ala Ala Scr 
Oln Leu Asp Leu Ser Gly £p Pfco Val Ala Gly JyJ Se r Gly Gly Asp 
He Tyr Hi a sor Leu Ser Arg Ala Arg Pro t% G1/ Ser His Hig J" 

Hxs Ilia Hia 575 



30 



<2X0> 4 
<2ll> 30 
<212> DWA 

<213> Artificial Sequence 
<220> 

<223> primer Jaind - Artificially synthesized primer 
sequence, SBNdelFW 

<400> 4 

catatgtcaa tgtcctacac atggacagcc 

<21Q> 5 
<2ll> 57 
<212> DNA 

<213> Artificial Sequence 
<22Q> 

<223, primer_bind - Artificially synthesized primer 
sequence, 5B570HRV 

ttattagtga tggtgatggt gatgggatcc geggggtegg gcacgagaca ggctgtg 57 

<2X0> 6 
<211> 57 
<2l2> DNA 

<213> Artificial Sequence 
<220> 

<223> primer^bind - Artificially synthesized primer 
sequence, SB552HRV 



6 



Received from < 650 327+3231 > at 5/1/02 5:01 :49 PM [Eastern Daylight Time] 



MAY-01-02 WED 01:58 Ptl ■CEVIC F IELD&FRANC IS FAX NO. 



^327+3231 P. 13 



<:400> 6 

ttattagtga tggtgatggt gatgggatcc aacgaaccag ccggacaagt ccagctg 57 

<210> 7 
<211> 57 
^212> DNA 

<213> Artificial Sequence 
<220> 

<223> primerbind - Artificially synthesized primer 
sequence, 5B544HRV 

<4QO> 7 

ttattagtga tggtgatggt gatgggatcc ctgggacgca geegggattg gagtgag 57 

<2io> a 

<211n 67 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> primer_bind - Artificially synthesized primer 
sequence, SB5 36HRV 

g^agagf 93 ********* gatgggatcc gagtttgagt ttggtcttca ctgcccagtt 60 

(57 

<210> 9 
< 2 1 1 > 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Jund - Artificially synthesized primer 
sequence, 5B531HRV 

<40Q> 9 

ttattagtga tggtgatggt gatgggatcc cttcactgcc cagttgaaga ggtacttgee 60 

<210> 10 
<2ll-> 52 
<212> DNA 

<213> Artificial Sequence 
<230> 

<223, primer J>ind - Artificially synthesized primer 
sequence, 5B591HRV 

<4Q0> 10 

ttattaat 3 g tgatggtgat ggcgcccgga tcgattgggg agcaggtaga tg $ 2 
<210> 11 
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<211> b 
<2 3.2> PR'p 

<213> Hepatitis C virus 
<220> 

<221> VARIANT 

<223> Xaa * Any Amino Acid 
<4oo> ai 

Xaa Asp Leu sor Gly Trp Phe Xaa 

1 5 

fo\ <2ii> 8 

^* V <:212> PRT 
fjlf^p < 2 ^3> Hepatitis C virus 

<400> 12 

I*« Asp Leu Ser Gly Trp phe 

5 
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